The Yahoo Query Treebank, V. 1.0

نویسندگان

  • Yuval Pinter
  • Roi Reichart
  • Idan Szpektor
چکیده

This dataset release accompanies Pinter et al. (2016) which describes the motivation and grammatical theory. Please cite that paper when referencing the dataset. The dataset may be accessed via the Yahoo Webscope homepage1 under Linguistic Data as dataset L-28. The description in Section 2 is included within the dataset as a Readme. The dataset is sure to have annotation errors which are not covered by the special cases specified in this document. Please approach the first author for any corrections and they will appear in the next release. See Section 4 for known errors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Netgraph Query Language for the Prague Dependency Treebank 2.0

We study the annotation of the Prague Dependency Treebank 2.0 (PDT 2.0) and assemble a list of requirements on a query language that would allow searching for and studying all linguistic phenomena annotated in the treebank. We propose an extension to the query language of an existing search tool Netgraph 1.0 and show that the extended query language satisfies the list of requirements. We demons...

متن کامل

Does Netgraph Fit Prague Dependency Treebank?

On many examples we present a query language of Netgraph – a fully graphical tool for searching in the Prague Dependency Treebank 2.0. To demonstrate that the query language fits the treebank well, we study an annotation manual for the most complex layer of the treebank – the tectogrammatical layer – and show that linguistic phenomena annotated on the layer can be searched for using the query l...

متن کامل

Searching in the Penn Discourse Treebank Using the PML-Tree Query

The PML-Tree Query is a general, powerful and user-friendly system for querying richly linguistically annotated treebanks. The present paper shows how the PML-Tree Query can be used for searching for discourse relations in the Penn Discourse Treebank 2.0 mapped onto the syntactic annotation of the Penn Treebank.

متن کامل

Towards a Simple and Full-Featured Treebank Query Language

Netgraph query language is a query system for linguistically annotated treebanks that aims to be sufficiently powerful for linguistic needs and yet simple enough for not requiring any programming or mathematical skill from its users. We provide an introduction to the system along with a set of examples how to search for some frequent linguistic phenomena. We also offer a comparison to the query...

متن کامل

Active Learning for Building a Corpus of Questions for Parsing

This paper describes how we built a dependency Treebank for questions. The questions for the Treebank were drawn from questions from the TREC 10 QA task and from Yahoo! Answers. Among the uses for the corpus is to train a dependency parser achieving good accuracy on parsing questions without hurting its overall accuracy. We also explore active learning techniques to determine the suitable size ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1605.02945  شماره 

صفحات  -

تاریخ انتشار 2016